Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 698 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 8 |
| Duplicate rows (%) | 1.1% |
| Total size in memory | 94.3 KiB |
| Average record size in memory | 138.4 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 2 |
| Dataset has 8 (1.1%) duplicate rows | Duplicates |
Clump_Thickness is highly overall correlated with Uniformity_of_Cell_Size and 6 other fields | High correlation |
Uniformity_of_Cell_Size is highly overall correlated with Clump_Thickness and 7 other fields | High correlation |
Uniformity_of_Cell_Shape is highly overall correlated with Clump_Thickness and 6 other fields | High correlation |
Marginal_Adhesion is highly overall correlated with Clump_Thickness and 6 other fields | High correlation |
Single_Epithelial_Cell_Size is highly overall correlated with Clump_Thickness and 6 other fields | High correlation |
Bland_Chromatin is highly overall correlated with Clump_Thickness and 6 other fields | High correlation |
Normal_Nucleoli is highly overall correlated with Clump_Thickness and 7 other fields | High correlation |
Mitoses is highly overall correlated with Uniformity_of_Cell_Size and 2 other fields | High correlation |
Bare_Nuclei is highly overall correlated with Class | High correlation |
Class is highly overall correlated with Clump_Thickness and 8 other fields | High correlation |
Reproduction
| Analysis started | 2023-02-19 23:42:45.796734 |
|---|---|
| Analysis finished | 2023-02-19 23:42:54.293769 |
| Duration | 8.5 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Sample_code_number
Real number (ℝ)
| Distinct | 644 |
|---|---|
| Distinct (%) | 92.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1071806.8 |
| Minimum | 61634 |
|---|---|
| Maximum | 13454352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 61634 |
|---|---|
| 5-th percentile | 411453 |
| Q1 | 870258.25 |
| median | 1171710 |
| Q3 | 1238354 |
| 95-th percentile | 1333897.7 |
| Maximum | 13454352 |
| Range | 13392718 |
| Interquartile range (IQR) | 368095.75 |
Descriptive statistics
| Standard deviation | 617532.27 |
|---|---|
| Coefficient of variation (CV) | 0.57616007 |
| Kurtosis | 257.34775 |
| Mean | 1071806.8 |
| Median Absolute Deviation (MAD) | 104381 |
| Skewness | 13.665481 |
| Sum | 7.4812114 × 108 |
| Variance | 3.8134611 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1182404 | 6 | 0.9% |
| 1276091 | 5 | 0.7% |
| 1198641 | 3 | 0.4% |
| 897471 | 2 | 0.3% |
| 1114570 | 2 | 0.3% |
| 1100524 | 2 | 0.3% |
| 733639 | 2 | 0.3% |
| 1240603 | 2 | 0.3% |
| 1105524 | 2 | 0.3% |
| 704097 | 2 | 0.3% |
| Other values (634) | 670 |
| Value | Count | Frequency (%) |
| 61634 | 1 | |
| 63375 | 1 | |
| 76389 | 1 | |
| 95719 | 1 | |
| 128059 | 1 | |
| 142932 | 1 | |
| 144888 | 1 | |
| 145447 | 1 | |
| 160296 | 1 | |
| 167528 | 1 |
| Value | Count | Frequency (%) |
| 13454352 | 1 | |
| 8233704 | 1 | |
| 1371920 | 1 | |
| 1371026 | 1 | |
| 1369821 | 1 | |
| 1368882 | 1 | |
| 1368273 | 1 | |
| 1368267 | 1 | |
| 1365328 | 1 | |
| 1365075 | 1 |
Clump_Thickness
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4169054 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8176734 |
|---|---|
| Coefficient of variation (CV) | 0.6379293 |
| Kurtosis | -0.62613125 |
| Mean | 4.4169054 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.59336867 |
| Sum | 3083 |
| Variance | 7.9392834 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 5 | 129 | |
| 3 | 108 | |
| 4 | 80 | |
| 10 | 69 | |
| 2 | 50 | 7.2% |
| 8 | 46 | 6.6% |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 9 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 2 | 50 | 7.2% |
| 3 | 108 | |
| 4 | 80 | |
| 5 | 129 | |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 8 | 46 | 6.6% |
| 9 | 14 | 2.0% |
| 10 | 69 |
| Value | Count | Frequency (%) |
| 10 | 69 | |
| 9 | 14 | 2.0% |
| 8 | 46 | 6.6% |
| 7 | 23 | 3.3% |
| 6 | 34 | 4.9% |
| 5 | 129 | |
| 4 | 80 | |
| 3 | 108 | |
| 2 | 50 | 7.2% |
| 1 | 145 |
Uniformity_of_Cell_Size
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1375358 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0525753 |
|---|---|
| Coefficient of variation (CV) | 0.97292126 |
| Kurtosis | 0.09341847 |
| Mean | 3.1375358 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2310346 |
| Sum | 2190 |
| Variance | 9.318216 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 383 | |
| 10 | 67 | 9.6% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 8 | 29 | 4.2% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 9 | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 383 | |
| 2 | 45 | 6.4% |
| 3 | 52 | 7.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 8 | 29 | 4.2% |
| 9 | 6 | 0.9% |
| 10 | 67 | 9.6% |
| Value | Count | Frequency (%) |
| 10 | 67 | 9.6% |
| 9 | 6 | 0.9% |
| 8 | 29 | 4.2% |
| 7 | 19 | 2.7% |
| 6 | 27 | 3.9% |
| 5 | 30 | 4.3% |
| 4 | 40 | 5.7% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 1 | 383 |
Uniformity_of_Cell_Shape
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2106017 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.9728667 |
|---|---|
| Coefficient of variation (CV) | 0.92595312 |
| Kurtosis | 0.0020723518 |
| Mean | 3.2106017 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1597997 |
| Sum | 2241 |
| Variance | 8.8379362 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 352 | |
| 2 | 59 | 8.5% |
| 10 | 58 | 8.3% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 6 | 30 | 4.3% |
| 7 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 352 | |
| 2 | 59 | 8.5% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 6 | 30 | 4.3% |
| 7 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| 10 | 58 | 8.3% |
| Value | Count | Frequency (%) |
| 10 | 58 | 8.3% |
| 9 | 7 | 1.0% |
| 8 | 28 | 4.0% |
| 7 | 30 | 4.3% |
| 6 | 30 | 4.3% |
| 5 | 34 | 4.9% |
| 4 | 44 | 6.3% |
| 3 | 56 | 8.0% |
| 2 | 59 | 8.5% |
| 1 | 352 |
Marginal_Adhesion
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8094556 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.8566059 |
|---|---|
| Coefficient of variation (CV) | 1.0167827 |
| Kurtosis | 0.98105377 |
| Mean | 2.8094556 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5223334 |
| Sum | 1961 |
| Variance | 8.1601974 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 406 | |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 10 | 55 | 7.9% |
| 4 | 33 | 4.7% |
| 8 | 25 | 3.6% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.2% |
| 7 | 13 | 1.9% |
| 9 | 5 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 406 | |
| 2 | 58 | 8.3% |
| 3 | 58 | 8.3% |
| 4 | 33 | 4.7% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.2% |
| 7 | 13 | 1.9% |
| 8 | 25 | 3.6% |
| 9 | 5 | 0.7% |
| 10 | 55 | 7.9% |
| Value | Count | Frequency (%) |
| 10 | 55 | 7.9% |
| 9 | 5 | 0.7% |
| 8 | 25 | 3.6% |
| 7 | 13 | 1.9% |
| 6 | 22 | 3.2% |
| 5 | 23 | 3.3% |
| 4 | 33 | 4.7% |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 1 | 406 |
Single_Epithelial_Cell_Size
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.217765 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.2154083 |
|---|---|
| Coefficient of variation (CV) | 0.68849288 |
| Kurtosis | 2.1606238 |
| Mean | 3.217765 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.709935 |
| Sum | 2246 |
| Variance | 4.908034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 385 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 1 | 47 | 6.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 10 | 31 | 4.4% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 9 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 47 | 6.7% |
| 2 | 385 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 5 | 39 | 5.6% |
| 6 | 41 | 5.9% |
| 7 | 12 | 1.7% |
| 8 | 21 | 3.0% |
| 9 | 2 | 0.3% |
| 10 | 31 | 4.4% |
| Value | Count | Frequency (%) |
| 10 | 31 | 4.4% |
| 9 | 2 | 0.3% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 4 | 48 | 6.9% |
| 3 | 72 | 10.3% |
| 2 | 385 | |
| 1 | 47 | 6.7% |
Bare_Nuclei
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.8 KiB |
| 1 | |
|---|---|
| 10 | |
| 2 | 30 |
| 5 | 30 |
| 3 | 28 |
| Other values (6) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.1891117 |
| Min length | 1 |
Characters and Unicode
| Total characters | 830 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 2 |
| 3rd row | 4 |
| 4th row | 1 |
| 5th row | 10 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 401 | |
| 10 | 132 | 18.9% |
| 2 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 8 | 21 | 3.0% |
| 4 | 19 | 2.7% |
| ? | 16 | 2.3% |
| 9 | 9 | 1.3% |
| 7 | 8 | 1.1% |
Length
| Value | Count | Frequency (%) |
| 1 | 401 | |
| 10 | 132 | 18.9% |
| 2 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 8 | 21 | 3.0% |
| 4 | 19 | 2.7% |
| 16 | 2.3% | |
| 9 | 9 | 1.3% |
| 7 | 8 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 533 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 814 | |
| Other Punctuation | 16 | 1.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 533 | |
| 0 | 132 | 16.2% |
| 2 | 30 | 3.7% |
| 5 | 30 | 3.7% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.6% |
| 4 | 19 | 2.3% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
| 6 | 4 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 830 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 533 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 830 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 533 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Bland_Chromatin
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4383954 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4400564 |
|---|---|
| Coefficient of variation (CV) | 0.70964973 |
| Kurtosis | 0.17921858 |
| Mean | 3.4383954 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0984966 |
| Sum | 2400 |
| Variance | 5.9538752 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 166 | |
| 3 | 164 | |
| 1 | 152 | |
| 7 | 73 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 8 | 28 | 4.0% |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 6 | 10 | 1.4% |
| Value | Count | Frequency (%) |
| 1 | 152 | |
| 2 | 166 | |
| 3 | 164 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 6 | 10 | 1.4% |
| 7 | 73 | |
| 8 | 28 | 4.0% |
| 9 | 11 | 1.6% |
| 10 | 20 | 2.9% |
| Value | Count | Frequency (%) |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 8 | 28 | 4.0% |
| 7 | 73 | |
| 6 | 10 | 1.4% |
| 5 | 34 | 4.9% |
| 4 | 40 | 5.7% |
| 3 | 164 | |
| 2 | 166 | |
| 1 | 152 |
Normal_Nucleoli
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8696275 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.0550042 |
|---|---|
| Coefficient of variation (CV) | 1.0645995 |
| Kurtosis | 0.46782665 |
| Mean | 2.8696275 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4200863 |
| Sum | 2003 |
| Variance | 9.3330504 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 442 | |
| 10 | 61 | 8.7% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 8 | 24 | 3.4% |
| 6 | 22 | 3.2% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 7 | 16 | 2.3% |
| 9 | 16 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 442 | |
| 2 | 36 | 5.2% |
| 3 | 44 | 6.3% |
| 4 | 18 | 2.6% |
| 5 | 19 | 2.7% |
| 6 | 22 | 3.2% |
| 7 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 9 | 16 | 2.3% |
| 10 | 61 | 8.7% |
| Value | Count | Frequency (%) |
| 10 | 61 | 8.7% |
| 9 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 7 | 16 | 2.3% |
| 6 | 22 | 3.2% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 1 | 442 |
Mitoses
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5902579 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7161624 |
|---|---|
| Coefficient of variation (CV) | 1.0791724 |
| Kurtosis | 12.633842 |
| Mean | 1.5902579 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.5576034 |
| Sum | 1110 |
| Variance | 2.9452134 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 578 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 10 | 14 | 2.0% |
| 4 | 12 | 1.7% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 578 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 4 | 12 | 1.7% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 10 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 10 | 14 | 2.0% |
| 8 | 8 | 1.1% |
| 7 | 9 | 1.3% |
| 6 | 3 | 0.4% |
| 5 | 6 | 0.9% |
| 4 | 12 | 1.7% |
| 3 | 33 | 4.7% |
| 2 | 35 | 5.0% |
| 1 | 578 |
Class
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.7 KiB |
| 2 | |
|---|---|
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 698 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 698 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 698 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 457 | |
| 4 | 241 |
| Sample_code_number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bland_Chromatin | Normal_Nucleoli | Mitoses | Bare_Nuclei | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample_code_number | 1.000 | -0.004 | -0.044 | -0.061 | -0.050 | -0.088 | -0.096 | -0.072 | -0.076 | 0.000 | 0.000 |
| Clump_Thickness | -0.004 | 1.000 | 0.667 | 0.665 | 0.542 | 0.585 | 0.538 | 0.571 | 0.419 | 0.223 | 0.738 |
| Uniformity_of_Cell_Size | -0.044 | 0.667 | 1.000 | 0.892 | 0.742 | 0.787 | 0.720 | 0.757 | 0.509 | 0.287 | 0.875 |
| Uniformity_of_Cell_Shape | -0.061 | 0.665 | 0.892 | 1.000 | 0.712 | 0.759 | 0.693 | 0.725 | 0.473 | 0.278 | 0.860 |
| Marginal_Adhesion | -0.050 | 0.542 | 0.742 | 0.712 | 1.000 | 0.668 | 0.625 | 0.634 | 0.447 | 0.263 | 0.738 |
| Single_Epithelial_Cell_Size | -0.088 | 0.585 | 0.787 | 0.759 | 0.668 | 1.000 | 0.640 | 0.706 | 0.480 | 0.270 | 0.791 |
| Bland_Chromatin | -0.096 | 0.538 | 0.720 | 0.693 | 0.625 | 0.640 | 1.000 | 0.663 | 0.387 | 0.255 | 0.804 |
| Normal_Nucleoli | -0.072 | 0.571 | 0.757 | 0.725 | 0.634 | 0.706 | 0.663 | 1.000 | 0.504 | 0.251 | 0.767 |
| Mitoses | -0.076 | 0.419 | 0.509 | 0.473 | 0.447 | 0.480 | 0.387 | 0.504 | 1.000 | 0.193 | 0.519 |
| Bare_Nuclei | 0.000 | 0.223 | 0.287 | 0.278 | 0.263 | 0.270 | 0.255 | 0.251 | 0.193 | 1.000 | 0.834 |
| Class | 0.000 | 0.738 | 0.875 | 0.860 | 0.738 | 0.791 | 0.804 | 0.767 | 0.519 | 0.834 | 1.000 |
| Sample_code_number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1002945 | 5 | 4 | 4 | 5 | 7 | 10 | 3 | 2 | 1 | 2 |
| 1 | 1015425 | 3 | 1 | 1 | 1 | 2 | 2 | 3 | 1 | 1 | 2 |
| 2 | 1016277 | 6 | 8 | 8 | 1 | 3 | 4 | 3 | 7 | 1 | 2 |
| 3 | 1017023 | 4 | 1 | 1 | 3 | 2 | 1 | 3 | 1 | 1 | 2 |
| 4 | 1017122 | 8 | 10 | 10 | 8 | 7 | 10 | 9 | 7 | 1 | 4 |
| 5 | 1018099 | 1 | 1 | 1 | 1 | 2 | 10 | 3 | 1 | 1 | 2 |
| 6 | 1018561 | 2 | 1 | 2 | 1 | 2 | 1 | 3 | 1 | 1 | 2 |
| 7 | 1033078 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 5 | 2 |
| 8 | 1033078 | 4 | 2 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 2 |
| 9 | 1035283 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 2 |
| Sample_code_number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 688 | 654546 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 8 | 2 |
| 689 | 654546 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | 1 | 2 |
| 690 | 695091 | 5 | 10 | 10 | 5 | 4 | 5 | 4 | 4 | 1 | 4 |
| 691 | 714039 | 3 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 2 |
| 692 | 763235 | 3 | 1 | 1 | 1 | 2 | 1 | 2 | 1 | 2 | 2 |
| 693 | 776715 | 3 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | 2 |
| 694 | 841769 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 2 |
| 695 | 888820 | 5 | 10 | 10 | 3 | 7 | 3 | 8 | 10 | 2 | 4 |
| 696 | 897471 | 4 | 8 | 6 | 4 | 3 | 4 | 10 | 6 | 1 | 4 |
| 697 | 897471 | 4 | 8 | 8 | 5 | 4 | 5 | 10 | 4 | 1 | 4 |
Most frequently occurring
| Sample_code_number | Clump_Thickness | Uniformity_of_Cell_Size | Uniformity_of_Cell_Shape | Marginal_Adhesion | Single_Epithelial_Cell_Size | Bare_Nuclei | Bland_Chromatin | Normal_Nucleoli | Mitoses | Class | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 320675 | 3 | 3 | 5 | 2 | 3 | 10 | 7 | 1 | 1 | 4 | 2 |
| 1 | 466906 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 2 |
| 2 | 704097 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 2 |
| 3 | 1100524 | 6 | 10 | 10 | 2 | 8 | 10 | 7 | 3 | 3 | 4 | 2 |
| 4 | 1116116 | 9 | 10 | 10 | 1 | 10 | 8 | 3 | 3 | 1 | 4 | 2 |
| 5 | 1198641 | 3 | 1 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | 2 | 2 |
| 6 | 1218860 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 2 | 2 |
| 7 | 1321942 | 5 | 1 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | 2 | 2 |